Hybrid Methods for Continuous State Dynamic Programming

نویسندگان

Paul L. Fackler

Mario J. Miranda

چکیده

We propose a method for solving continuous-state and action stochastic dynamic programs that is a hybrid between the continuous space projection methods introduced by Judd and the discrete space methods introduced by Bellman. Our hybrid approach yields a smooth representation of the value function while preserving the computational simplicity of discrete dynamic programming. The method is especially well suited for implementation in a vector processing environment such as MATLAB or GAUSS, and makes it possible to automate the setup and solution of continuous space dynamic programs in a way that previously seemed elusive. Dynamic programming, while familiar and often the natural way to handle many decision problems, has not seen widespread application. In part this is due to the inherent limitations due to the curse of dimensionality: problems with many state variables become too large to solve. Another reason, however, is surely due to the diiculties one faces in formulating models and solving them. The practice of dynamic programming seems to be the domain of specialists who program each problem anew. This need not be the case. Discrete time dynamic programming problems, for which the state and policy variables can take on a discrete nite set of values, can be completely characterized by a discount factor and two matrices, one describing the current reward function and the other describing the state transition function. Solution methods in this case require maximization over the elements in each column of a matrix and simple matrix arithmetic (addition, multiplication and a linear solve operation). We propose here a similar method for problems with continuous state and action variables that is a hybrid between projection and discretization methods and that retains the best features of both. With this approach one can characterize a DP problem with a discount factor and three matrices, with the addition, for stochastic 1

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stochastic Dynamic Programming with Markov Chains for Optimal Sustainable Control of the Forest Sector with Continuous Cover Forestry

We present a stochastic dynamic programming approach with Markov chains for optimal control of the forest sector. The forest is managed via continuous cover forestry and the complete system is sustainable. Forest industry production, logistic solutions and harvest levels are optimized based on the sequentially revealed states of the markets. Adaptive full system optimization is necessary for co...

متن کامل

Improving the Energy Management of Parallel Hybrid Electric Vehicle by Dynamic Programming Using Electro-Thermal Model of Battery

In this paper, an offline energy management system (EMS) is proposed for parallel hybrid electric vehicles (HEVs). The proper energy management system is necessary for dividing torque between electrical motor and Internal Combustion Engine (ICE). The battery is a crucial component of hybrid electric vehicles and affects significantly the cost and the performance of the whole vehicle. The primar...

متن کامل

Approximate Dynamic Programming Recurrence Relations for a Hybrid Optimal Control Problem

This paper presents a hybrid approximate dynamic programming (ADP) method for a hybrid dynamic system (HDS) optimal control problem, that occurs in many complex unmanned systems which are implemented via a hybrid architecture, regarding robot modes or the complex environment. The HDS considered in this paper is characterized by a well-known three-layer hybrid framework, which includes a discret...

متن کامل

A Hybrid Dynamic Programming for Inventory Routing Problem in Collaborative Reverse Supply Chains

Inventory routing problems arise as simultaneous decisions in inventory and routing optimization. In the present study, vendor managed inventory is proposed as a collaborative model for reverse supply chains and the optimization problem is modeled in terms of an inventory routing problem. The studied reverse supply chains include several return generators and recovery centers and one collection...

متن کامل

Convex dynamic programming for hybrid systems

A classical linear programming approach to optimization of flow or transportation in a discrete graph is extended to hybrid systems. The problem is finite dimensional if the state space is discrete and finite, but becomes infinite dimensional for a continuous or hybrid state space. It is shown how strict lower bounds on the optimal loss function can be computed by gridding the continuous state ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

Hybrid Methods for Continuous State Dynamic Programming

نویسندگان

چکیده

منابع مشابه

Stochastic Dynamic Programming with Markov Chains for Optimal Sustainable Control of the Forest Sector with Continuous Cover Forestry

Improving the Energy Management of Parallel Hybrid Electric Vehicle by Dynamic Programming Using Electro-Thermal Model of Battery

Approximate Dynamic Programming Recurrence Relations for a Hybrid Optimal Control Problem

A Hybrid Dynamic Programming for Inventory Routing Problem in Collaborative Reverse Supply Chains

Convex dynamic programming for hybrid systems

عنوان ژورنال:

اشتراک گذاری